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Abstract 

Background: Nine subgenotypes from genotype B liave been identified for liepatitis B virus (HBV). However, tliese 
subgenotypes were less conclusive as they were often designated based on a few representative strains. In 
addition, subgenotype B6 was designated twice for viruses of different origin. 

Methods: All complete genome sequences of genotype B HBV were phylogenetically analyzed. Sequence 
divergences between different potential subgenotypes were also assessed. 

Results: Both phylogenetic and sequence divergence analyses supported the designation of subgenotypes Bl, B2, 
B4, and B6 (from Arctic). However, sequence divergences between previously designated B3, B5, B7, B8, B9 and 
another B6 (from China) were mostly less than 4%. In addition, subgenotype B3 did not form a monophyly. 

Conclusion: Current evidence failed to classify original B5, B7, B8, B9, and B6 (from China) as subgenotypes. Instead, 
they could be considered as a quasi-subgenotype B3 of Southeast Asian and Chinese origin. In addition, previously 
designated B6 (from Arctic) should be renamed as B5 for continuous numbering. This novel classification is well 
supported by both the phylogeny and sequence divergence of >4%. 
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Background 

By comparing 18 HBV genomes, Okamoto et al pro- 
posed that sequence divergence of>8% over the entire 
genome should be used for HBV genotyping in 1988 [1]. 
Based on this, four genotypes were identified as A, B, C, 
and D. So far, at least eight genotypes, from A to H, have 
been identified and widely accepted. Genotypes B and C 
are prevalent in Asia and co-infection and/or super- 
infection lead to frequent recombination between these 
two genotypes [2,3]. 

Based on the rule that different subgenotypes should di- 
verge by at least 4% over the genome [4], genotypes A [5], 
B [6], C [7], D [8], and F [9] have been reported to have 
evolved into various subgenotypes [10]. Genotype B was 
initially divided into two subgenotypes: Bj (j for Japan) and 
Ba (a for Asia) [2]. Viruses of subgenotype Bj are not 
recombinants, while those of subgenotype Ba are B/C 
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recombinants, with their preC-C genes coming from 
genotype C [2]. In 2004, Bj was renamed as Bl and Ba was 
renamed as B2 [11]. In this report, subgenotypes B3 and 
B4 were also described. Subgenotype B3 was composed of 
four strains from Indonesia, while subgenotype B4 mostly 
comprised strains from Vietnam and France [11]. Subge- 
notype B5 was initially reported in 2006 from the Philip- 
pines [12]. A few months later, Sakamoto et al. found a 
few viruses from the Philippines that differed from subge- 
notypes Bl to B4, and also designated it as a novel subge- 
notype, B5 [13]. Subgenotype B6 was identified in 2007 
from arctic indigenous populations [3]. In particular, just 
as with subgenotype Bl, viruses of B6 were not recombi- 
nants. This observation led Sakamoto et al. to classify 
genotype B into two types, non-recombinant (Bl and B6) 
and recombinant (B2 to B5) [3]. Viruses of B7 were iso- 
lated from the Nusa Tenggara islands in Eastern Indonesia 
from 2008 [14]. Subsequently, subgenotype B8 was also 
identified in Indonesia by analyzing a large cohort of pa- 
tient samples [15]. More recently, viruses of subgenotype 
B9 were isolated from the same region in Indonesia where 
subgenotype B8 was isolated [6]. This suggested that the 
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Figure 1 A schematic phiylogenetic tree constructed using all genotype B HBV sequences. 



distribution of HBV subgenotypes might relate to the eth- 
nic origin of the infected patients [6]. 

However, previous studies just used a number of 
selected representative strains to perform phylogenetic 
analysis and estimate sequence divergence. Hence, the 
designation of novel HBV subgenotypes was less conclu- 
sive. In this study, we reanalyzed all of the full-length 



genome sequences of genotype B using phylogenetic 
analysis. This analysis led to the proposal of a novel and 
consistent classification for HBV of genotype B. 

Methods 

We have previously analyzed a total of 3471 full length 
genome sequences of HBV using a phylogenetic approach 



Table 1 Mean nucleotide sequence divergences over the complete genome sequences of HBV between and within 
subgenotypes B6, 81, B4 and B2 

Subgenotypes 86 B1 B4 Within group sequence divergence 

B6 0.027 ±0.002 

B1 0.062 ±0.004 0.025 ±0.001 

B4 0.074 ±0.005 0.055 ±0.003 0.025 ±0.001 

82 0.071 ± 0.004 0.047 ± 0.003 0.039 ± 0.003 0.01 7 ± 0.001 
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Table 2 Mean nucleotide sequence divergences over the complete genome sequences of HBV between previously 
designated subgenotypes in cluster 3 



Groups 


B5 




B6' 




B3-1 




87 




B3-2 




B9 




B3-3 




B8 


B6' 


0.039 ± 


: 0.003 




























B3-1 


0.030 ± 


: 0.002 


0.028: 


± 0.003 
























87 


0.035 ± 


: 0.002 


0.033 : 


± 0.003 


0.01 9 ± 


: 0.002 




















B3-2 


0.037 ± 


: 0.002 


0.035 : 


± 0.003 


0.023 ± 


: 0.002 


0.025 


±0.002 
















B9 


0.033 ± 


: 0.002 


0.034: 


± 0.003 


0.025 ± 


: 0.002 


0.030 


±0.002 


0.032 : 


± 0.002 












B3-3 


0.040 ± 


: 0.002 


0.040: 


± 0.003 


0.033 ± 


: 0.002 


0.037 


±0.002 


0.039: 


± 0.002 


0.037: 


±0.002 








B8 


0.038 ± 


: 0.002 


0.038: 


± 0.003 


0.028 ± 


: 0.002 


0.033 


±0.002 


0.035 : 


± 0.002 


0.033 : 


±0.002 


0.038: 


± 0.002 




Bx 


0.045 ± 


: 0.003 


0.046: 


± 0.004 


0.038 ± 


: 0.003 


0.042 


±0.003 


0.042 : 


± 0.003 


0.042 : 


±0.003 


0.048: 


± 0.003 


0.045 ±0.003 



[16]. Our results showed that 860 sequences belonged to 
genotype B. These sequences were selected to generate a 
new dataset for further analysis. In addition, a sequence of 
genotype C (GenBank:EU939604) was also included in the 
new dataset and used as an outgroup. Information of these 
sequences, such as subgenotype and recombination, was 
also extracted from GenBank annotations. For sequences 
with a reference available in Pubmed, we performed an ex- 
tensive literature review to obtain their subgenotype and 
recombination information. This information was used to 
define the subgenotypes. 

Phylogenetic analysis of the genotype B strains was 
carried out using RAxML [17] under the GTRCAT ap- 
proximation [18] and random starting trees. Three thou- 
sand rapid bootstrap replicates were performed and all 
other parameters were set to default. Trees were visua- 
lized and analyzed using Dendroscope [19]. The tree is 
available as Additional file 1. 

The mean nucleotide divergence (mean ± SD) between 
different subgenotypes was calculated using Mega 5 [20] 
under the Kimura 2-parameter model [21]. Five hundred 
bootstrap replicates were applied to obtain consistent 
and reliable sequence divergence values. 

Results 

Phylogenetic analysis of all HBV genotype B full-length 
genome sequences revealed five distinct major clusters. 
They were named as cluster 1 to 5 from top to bottom 
(Figure 1). Most of the clusters were well supported with 
high bootstrap values. Cluster 1 was composed of 27 
viruses belonging to subgenotype B6 [3], with a boot- 
strap value of 100%. 37 viruses of cluster 2 were isolated 
from Japan and were designated as subgenotype Bl 
[2,11]. The bootstrap value for this cluster was 92%. 46 
viruses of subgenotype B4 [11] constituted cluster 4, 
with a bootstrap value of 98%. 

Phylogenetic analysis grouped 655 subgenotype B2 
viruses into cluster 5 [2,11], representing approximately 
76% of all genotype B strains. All viruses of this cluster 
were B/C recombinants, with the great majority of the 



strains isolated from Asia (Additional file 1). However, 
the bootstrap value for this cluster was only 46%. 

Apart from the sequence divergence between B4 and 
B2 (3.9 ± 0.3%), sequence divergences between any other 
two subgenotypes (B6, Bl, B4 and B2) were greater than 
4% (Table 1). Furthermore, the within subgenotype 
divergences of these four subgenotypes were below 4% 
(Table 1). 

Cluster 3 was also well supported with a bootstrap value 
of 72%. Unlike the described clusters composed of a 
single subgenotype, cluster 3 included several previously 
reported subgenotypes, such as B5, B3, B9, B8 and B7 
(Figure 1). In addition, a few Chinese strains named as B6 
[22] were included into cluster 3. Due to the existence of 
another subgenotype B6 [3], the Chinese strains were ten- 
tatively renamed as B6'. Apart from the previously desig- 
nated subgenotypes, two sequences isolated from China 
(GenBank:AY800391 and GenBank:AY800392) have not 
been designated a subgenotype and were tentatively 
named Bx. In particular, viruses of subgenotype B3 did not 
form a monophyly. Instead, subgenotype B3 viruses were 
scattered in three parts, B3-1, B3-2 and B3-3 (Figure 1). 

To resolve the aforementioned problems, we calcu- 
lated sequence divergences between B5, B3-1, B3-2, B3- 
3, B9, B8, B7, B6' and Bx (Table 2). Apart from Bx, se- 
quence divergence between any two of these potential 
subgenotypes was usually below 4% (Table 2). Although 
sequence divergences between Bx and other potential 
subgenotypes were mostly greater than 4%, the phyl- 
ogeny did not support Bx to be a novel subgenotype in 
that it did not form a monophyly compared to other po- 
tential subgenotypes in this cluster (Figure 1). Therefore, 

Table 3 Mean nucleotide sequence divergences over the 
complete genome sequences of HBV between quasi- 
subgenotype B3 and other subgenotypes 

newBS B1 B4 B2 

(previous B6) 

quasi-subgenotype B3 6.1 ±0.4% 6.8 ±0.3% 5.8 ±0.3% 5.8 ±0.3% 
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Figure 2 The novel classification of subgenotypes for genotype B HBV. 



sequence divergences did not support the designation of 
subgenotypes B5, B9, B8, B7, B6] and Bx. 

Discussion 

Previous studies have shown that different HBV geno- 
types and subgenotypes may cause differences in disease 
progression, response to anti-viral treatment regimens 
and in clinical outcomes [4,10,23]. Therefore, the accur- 
ate classification of HBV is important. To resolve the 
problems in HBV subgenotyping and to propose a con- 
sistent classification for genotype B HBV, we analyzed 
860 complete genome sequences of genotype B using 
phylogenetic analysis. 

Phylogenetic analysis showed that genotype B HBV 
has evolved into five major clusters. Four of them corre- 
sponded to B6 (cluster 1), Bl (cluster 2), B4 (cluster 4) 
and B2 (cluster 5). Apart from subgenotype B2, the 
remaining three subgenotypes were well supported by 
both high bootstrap values and sequence divergences 
of>4%. Therefore, subgenotypes B6, Bl and B4 were 
properly designated. Although the bootstrap value for 
subgenotype B2 was only 46% and the sequence diver- 
gence between subgenotypes B4 and B2 was less than 
4%, since subgenotype B2 has been widely accepted, it 
should be maintained in order to avoid more confusion. 

However, our results failed to support the designation 
of subgenotypes B5, B3, B9, B8, B7 and B6'. First, subge- 
notype B3 was not a monophyly, but scattered in the 
tree. This was against the rule that an HBV genotype 
and subgenotype should be a monophyly [24]. Second, 



sequence divergences between the above subgenotypes 
were mostly less than 4%. This also did not support 
them to be separate subgenotypes. 

Instead, considering that viruses in this cluster are all 
isolated from Southeast Asia and China and this cluster 
is well supported by a high bootstrap value of 72%, we 
proposed that a quasi-subgenotype B3 should be used 
for cluster 3 according to previous reports, in which the 
definition of quasi-subgenotype has been used to re- 
solve the inconsistency in the subgenotyping of geno- 
type A HBV [24,25]. Sequence divergences between the 
quasi-subgenotype B3 and B6, Bl, B4 and B2 were 
6.1 ± 0.4%, 6.8 ± 0.3%, 5.1 ± 0.3% and 5.1 ± 0.3%, respect- 
ively (Table 3). In addition, the within subgenotype di- 
vergence of quasi-subgenotype B3 was 3.1 ±0.1%, less 
than 4% to differentiate a subgenotype. 

Taken together, we corrected the incongruence in the 
classification of genotype B HBV and proposed a con- 
sistent classification for genotype B based on a phylo- 
genetic analysis of all genotype B HBV complete 
genome sequences (Figure 2). In this classification, ori- 
ginally designated B3, B5, B7, B8, B9 and B6' (sequences 
from China) comprised a quasi-subgenotype B3. For 
continuous numbering, the subgenotype B6 has been 
renamed as new B5. This classification is well supported 
by both phylogenetic analysis and sequence divergences. 
To avoid misclassification of HBV subgenotypes, we in- 
sist that the rules [24,26] proposed to define novel geno- 
types and subgenotypes should be strictly obeyed. 
In addition, if possible, novel genotype/subgenotype 
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should be made following a complete comparison of all 
relevant sequences rather than with a few representative 
sequences. 

Conclusions 

Our results revealed that subgenotypes Bl, B2, B4, and 
B6 (from Arctic) have been properly designated and 
should be maintained in the new classification. However, 
sequence divergences between previously designated B3, 
B5, B7, B8, B9 and another B6 (from China) were mostly 
less than 4%. In addition, subgenotype B3 did not form a 
monophyly. Therefore, current evidence failed to classify 
original B5, B7, B8, B9, and B6 (from China) as subgeno- 
types. Instead, they could be considered as a quasi- 
subgenotype B3 of Southeast Asian and Chinese origin. 
Moreover, previously designated B6 (from Arctic) should 
be renamed as B5 for continuous numbering. This novel 
classification is well supported by both the phylogeny 
and sequence divergence of > 4%. 

Additional file 



Additional file 1: The phylogenetic tree constructed using all 
genotype B HBV sequences. 
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